The Cochrane Schizophrenia Group’s Register of studies details all aspects of the effects of treating people with schizophrenia. It has been gathered over the last 20 years and consists of around 20,000 documents, overwhelmingly in PDF. Document collections of this sort – on a given theme but gathered from a wide range of sources – will generally have huge variability in the quality of the PDF, particularly with respect to the key property of text searchability.\ud\udSummarising the results from the best of these papers, to allow evidence-based health care decision making, has so far been done by manually creating a summary document, starting from a visual inspection of the relevant PDF file. This labour-intensive process has resulted, to date, in only 4,000 of the papers being summarised – with enormous duplication of effort and with many issues around the validity and reliability of the data extraction.\ud\udThis paper describes a pilot project to provide a computer-assisted framework in which any of the PDF documents could be searched for the occurrence of some 8,000 keywords and key phrases.Once keyword tagging has been completed the framework assists in the generation of a standard summary document, thereby greatly speeding up the production of these summaries. Early examples of the framework are described and its capabilities illustrated.
展开▼
机译:Cochrane精神分裂症小组的研究记录详细介绍了治疗精神分裂症患者的效果的各个方面。在过去的20年中,它已经被收集起来,包含大约20,000个文档,绝大多数为PDF。此类文档集合(以给定主题为主题,但从各种各样的来源中收集)通常在PDF的质量方面具有巨大的可变性,尤其是在文本可搜索性的关键属性方面。\ ud \ ud迄今为止,这些文件中的最好的文件是通过从相关PDF文件的目视检查开始手动创建摘要文件来完成的,以允许基于证据的医疗保健决策。迄今为止,这种劳动密集型的过程仅汇总了4,000篇论文-付出了巨大的努力,并且围绕数据提取的有效性和可靠性存在许多问题。\ ud \ ud本文介绍了一个试点项目,旨在提供一个计算机辅助框架,可在其中搜索任何PDF文档以查找8,000个左右的关键字和关键词。一旦完成关键字标记,该框架将有助于生成标准摘要文档,从而大大加快了生产速度这些摘要中。描述了该框架的早期示例,并说明了其功能。
展开▼